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Abstract. This paper is dedicated to the problem of verification of matri- 
ces for unitary similarity. For the case of nonderogatory matrices, we have 
been able to present the new solution for this problem based on geometric 
approach. The main advantage of this approach is stability with respect to 
errors in the initial upper triangular matrix. Since an upper triangular form 
is usually obtained by approximate methods (e.g. by QR algorithm), the 
mentioned advantage seems even more significant and allows us to propose 
the numerically stable and efficient method for verification of matrices for 
unitary similarity. 
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1. Introduction 



Matrices A,B £ (^nxn are un itarily similar if a similarity transformation be- 
tween them can be implemented using a unitary matrix U : 



A matrix A € c nxn j s called nonderogatory if its Jordan blocks have distinct 
eigenvalues. Equivalently, a matrix A € C nxn is nonderogatory if and only if 
its characteristic polynomial and minimum polynomial coincide. 

This paper concerns the verification of matrices for unitary similarity. Based 
on other authors' works concerning this problem, two basic approaches can be 
identified. 

In the first, a complete system of matrix invariants under a unitary similarity 
transformation is constructed. In a sense, the final result in this direction is 
the Specht-Pearcy criterion (see [IMSD) which reduces the question to verifying 
conditions of the form 



for all words W(s, t) of length at most 2n 2 . However, it seems that the number 
of words to be verified is strongly overestimated (see [3H6]). Moreover, this 
method cannot find a matrix generating a given unitary similarity. 

The second approach is free of this shortcoming and consists of constructing a 
canonical form of matrices with respect to unitary similarity transformations. 
Inductive definitions of the canonical form of a matrix were proposed in [7H2], 
but it is hard to visualize the final canonical form. In more recently work [TO] 
the autors, considered the set of nonderogatory matrix, constructed more visual 
canonical form. 

In this work we poropose the geometric approach to solving the problem for 
nonderogatory matrices. Given an arbitrary nonderogatory matrix, we con- 
struct a finite family of unitarily similar matrices for it (this family is called 
canonical). Whether or not two matrices are unitarily similar can be answered 
by verifying the intersection of their corresponding families. This method for 
unitary similarity verification has the significant advantage over the method [TO] 
based on construction of the canonical form, it is stable with respect to errors 
in the initial matrix. This last aspect is discussed at the end of this paper. 

While constructing a canonical family, we start from an upper triangular matrix 
form. Specifically, by the Schur theorem, any matrix A £ £! nxn can b e reduced 



(1.1) 



B = UAU*. 



(1.2) 



tcW{A,A*) = tvW(B,B*) 
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to such a form by using a unitary similarity transformation: 

"Ai 



(1.3) 



A 



A12 

A 2 



A13 

A 23 
A 3 



A 2 n 



A (i 



where Ai, . . . , \ n are eigenvalues of the matrix with multiplicity in some fixed 
order. The next statement let us to restrict the set of unitary transformations 
while operating with a nonderogatory triangular matrices. 

Lemma 1.1. Let A be a nonderogatory complex n x n matrix, and let A be 
its upper triangular form with eigenvalues Ai, . . . , A n on the diagonal in some 
fixed order. Then the magnitudes of the elements Ay, where i < j, are uniquely 
determined. 



Proof. Since the similar proposition for matrices with simple eigenvalues is 
known [8], we can consider the case of nonderogatory matrix with the single 
eigenvalue A. Let A is obtained from A by unitary similarity transformation 

(1.4) A = Q*AQ 

where Q = (qiq2 ■ ■ ■ q n ) is unitary matrix. Rewriting equation (jl.4p as AQ = 
QA, one may see that q\ is normalized eigenvector of the matrix A correspond- 
ing to the eigenvalue A. Further, 

(1.5) Aq 2 = Ai2<?i + Ag 2 

, hence (A — \E)q% = 0, i.e. q 2 is generalized eigenvector of A. Adding the 
condition of orthonormality of the pair qi,q2, one obtains that q 2 is uniquely 
determined up to multiplication by a scalar of unit modulus. Continuing in 
the same vein, we can see that the matrix Q is uniquely determined up to 
multiplication by a diagonal unitary matrix, but such a transformations preserve 
the magnitudes of the off-diagonal elements of upper triangular form Q. Thus 
the lemma is proved. 

Using the last lemma we can limit our consideration to studying the action of 
the group of unitary similarity transformations with diagonal matrices on the 
set of upper triangular matrices: 

(1.6) A ^ AAA*, X = diag(e i ^ 1 ,...,e i ^- 1 ,l), 



(1.7) {XAX*}i 



'Aije^-^ i<j<n, 

Aye*^ i<j = n, 

Ai i = j, 

i> j 



(assumming the last diagonal entry of X is 1, we remove a scalar factor from 
A). 
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2. Preliminary constructions 



Let M denote the range of the parameters of the matrix 
(2.1) M = {(ri2, . . . ,r n _i, n ; ipu, . . . ,<p n -i jn ), r^-,^ e 

and let M r denote its restriction for fixed r 



(2.2) 



M r = {(r; 99) G M : r^are fixed}. 



The indices % and j run over the values 1 < i < j < n and are ordered lexi- 
cographically. For elements of M and M r , several equivalent forms of notation 
are used: 

(2.3) (n 2 , • • • , r n -i >n ) ip 12 , ■ ■ ■ , ¥>n-l,n) = ( r ', <Pl2 , ■ ■ ■ , </>n-l,n) = (r; If). 

Looking ahead, rjj and ipij will later play the role of absolute values and argu- 
ments of off-diagonal elements of A. Despite this geometric interpretation, no 
constraints are as yet imposed on rjj and </% and the indetermination of (fij at 
Tij = is ignored. At this stage, we work with the formally defined range M. 

On M we introduce the family of transformations 

(2.4) X^:(r;<p)^(r;<p), 
<fij = <Pij + ipi - ipj, 1 < i < j < n - 1, 
<fin = Pin + ipi, 1 < i < n - 1. 

Each such a transformation is defined by a parameter vector ip = (tpi, . . . , ip n -i) € 



(2.5) 



Consider a subset of matrices K C M whose elements satisfy the system of 
equations 

s— 1 n 

(2.6) ~ ^ r ks <p ks + ^ r sfc (^ sfc =0, s = l,...,n-l. 



fc=i 



fc=s+l 



The summation indices in ()2.6p are visually described by the diagram 

"Ai * 



(2.7) 



A, 



The reduction of an arbitrary matrix of M to a K form by applying a trans- 
formation is reduced to finding the parameters of this transformation i/j = 
(ipi, . . . , ipn-l) by solving the system of linear equations 

(2.8) R(r)ijj = -b(r,<p) 



UNITARY SIMILARITY 



5 



with the symmetric matrix 

-nj i < j, 

(2.9) {fl(r)} fi = | Efc=\ r ki + Zk=i+i » = j, 

and with a righthand side that is linear in r and ip: 
b(r, ip) = (bi(r, <p),..., b n -i(r, if)), 

o s ( r > ( />) = -2_^ r k3<Pk8 + 2^ r ^sfc, s = l,...,n-l. 

fc=l fc=s+l 

System (|2.8p has some remarkable properties. 

Theorem 2.1. (i) For any ipij and nonnegative fij, system (|2.8p has a solution; 
i.e., 

(2.11) - b(r,(p) € ImR(r), V^-, Vr^ > 0. 

(m^ For all > 0, i/ie determinant detR(r) ^ is nonzero if and only if the in- 
dices of the nonzero elements r*y > contain a collection (ij) , ■ ■ ■ , {ij) n ~ l such 
that the set { ?/V — V'j'p {respectively ipiv, if j p = 0), p = 1, . . . , n — 1} 
forms a linearly independent system of functions of variables (ibx, . . . ,-0 n -i)- 

(Hi) Even if detR(r) = for some r^ > 0, the solution of the equation if) = 
(ipi, . . . , tpn-i) ^ such that the quantities 

fijiibi — ibi), 1 < i < i < n — 1 and 

(2.12) 3 J ~ ~ 
TiniPi, 1 < i < n - 1 

are uniquely defined. This means that nonuniqueness in the definition oftpi—ipj 
occurs if and only if rij = 0. 

Proof. On the set M r , we introduce the natural structure of a Euclidean space: 

(2.13) (r; <pV>) + (r; ^) = (r; ^ + <pg\ . . . , + ^J, 

(2.14) a(r;y?) = (r;aip 12 , . . . ,a<p n - 1>n ), a£R, 

l<i<j'<n 



(2-15) ((r; V ,( 1 )),(r;^))= £ 



Then K r = M r f] K is a linear subspace of M r that is orthogonal to all linear 

manifolds of the form 

(2.16) 

l<j<j<n— 1 l<i<n— 1 

(2.17) % = (r;0,...,0, 1 ,0,...,0). 
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The dimensions of K r and G r>v depend on r, but their sum is a constant: 

(2.18) d\m.K r + dimGV^ = dimM r . 

In other words, in the Euclidean space M r , the linear space K r and the linear 
manifold G r v are mutually orthogonal and the sum of their dimensions is the 
complete one. This implies that they have a unique intersection point (r; ip') = 
K r p| G r m. This intersection condition corresponds to the system of equations 

(2.19) R(r 2 )iP = -b(r,ip), 
where r 2 denotes the vector 

(2-20) r 2 = (r 2 2 ,...,r 2 _ 1>n ). 

In terms of ip, the existence and uniqueness of an intersection point (r; ip') 
means that system (|2.19p is solvable with arbitrary tpij and and that the 
values 

nj(ipi-ipj) = fa, 1 <i < j <n-l and 
n n ipi = fin, 1 < i < n - 1 
are uniquely determined from it. 

Assume that there exists an index set (ij) , . . . , (ij) n_1 corresponding to the 
nonzero elements of R(r 2 ) such that the set { ipiv — ipjp (respectively tpiP , if 
0), p = 1, . . . , n — 1} forms a linearly independent system of functions of vari- 
ables (V>i, • • • , V'n-i)- Then a nondegenerate system of linear equations can 
be composed of relations (|2.2ip and i\) = (ipi, . . . ,ip n -i) can be uniquely de- 
termined. Thus, under the conditions formulated, system (|2.19p has a unique 
solution and, hence, detR(r) ^ 0. The converse can be proved by contradiction. 

The above results are extended to system (|2.8p by making the substitution 
r\j = rfj > 0. The proof is complete. 

Returning to the matrix A, we use Theorem 12.11 to construct the family of 
matrices that are unitarily similar to A. 

With the help of the elements of A, we set up the system of linear equations 

(2.22) R{r)ip = -6(r,^ + 2vrm), 
where r, ip, and m are defined as 

(2.23) rij = | Ay |, fij = arg Ay - tt, mij G Z 

Note that, despite the indetermination of ipij at rjj = 0, the system of equations 
is uniquely defined. 

Solving this system for t/j = (ip\, . . . ,ip n -i), we construct the matrix XAX*, 
X = diag(e 1 ^ 1 , . . . ,e 1 ^ n ~ 1 , 1), which is unitarily similar to the original one. 
Again, if for some matrix A the parameter vector if) is not determined uniquely 
from system (|2.22p . then, by Theorem 12.11 this nonuniqueness is such that the 
matrix XAX* is uniquely determined. 
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The matrix generated by this procedure from A with the parameter vector 
m = (mi2, . . . , m n _i in ) is denoted by /C(A, m). 

3. Algorithm for constructing the canonical family 

Now, we consider two nonderogatory upper triangular matrices A^ 1 ) and A^ 2 ) 
with identical sets of eigenvalues. The eigenvalues are assumed to be identically 
ordered on the matrix diagonals. For these matrices, we introduce r jj" , tpQ and 

rjj , <p3 similar to (|2,23|) . The matrices A^ 1 ) and A^ 2 ) are related by a unitary 
similarity transformation if and only if 

(i) rjj^ = 1 < i < j < n and 

n(n — 1) 

(ii) there exist sets (V>i, • • • , VVi-i) G M n and (&i2, . . . , /c n _i in ) £ Z 2 such 
that, for indices (ij) corresponding rQ = > 0, we have 

(3.1) <p$ + 27rfcg ) = 4 2) + 2irkff 

This implies that a unitary similarity of A^ and A^ 2 ) is equivalent to JC(C^ , A^ 1 ) ) = 
IC(C^ 2 \ k^) for some integer parameter vectors A^ 1 ) = (fcy,...,^_ ln ) and 

1.(2) _ (1.(2) fr (2) \ 

Let us represent the above criterion in an effective form. Define a subset I C 



(3.2) I = {k € Z^~~ : % = 0, ±1, 1 < % < j < n - 1, 

ki n = 0, 1 < i < n — 1}. 

Theorem 3.1. T/ie matrices A^ and A^ are unitarily similar if and only if 
there exist vectors jfeW.JfeW 6 I attcft i/iai £(CW, fcW) = /C(C( 2 ), «( 2 )). 

Proof. Let A^ 1 ' and A^ 2 ) be unitarily similar and all their elements above the 
diagonal be nonzero. Then, as was shown above, there exist vectors (?/>i, • • • , ifin-i) £ 

IT- 1 and k^\k^ € IT^^t 21 such that equalities hold for all (ij). We use 
them to make up the following linear combinations: 

(3.3) „«« -«£> + «£)_ 

One may see that the ^-dependent terms have canceled out. A feature of these 
linear combinations is that they are invariant under the action of transforma- 
tions Xij, on the linear space of vectors if = (</?i2j ■ ■ • , ip n -i, n )- Moreover, these 
combinations form a basis in the subspace of linear functionals invariant under 
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Note that the conditions tpf) G [— 7r,7r) imply tp^y — tpyj + tpy^ G (— 37r,37r), 
which in turn imply the following constraints on A^ 1 ) and k^\ 

(3-4) (k^f - k% + - - *£> + fcg) = 0, ±1, ±2. 

At the same time, the algorithm for deriving the matrix /C(A, 0) shows that the 
arguments of its elements are linearly expressed in terms of ipiy. 

(3.5) <pij G C(<px2,.-.,<Pn-l,n), 

Moreover, these linear combinations must be invariant under X^, so their form 
can be refined: 

(3.6) ifij G C({tpij - (fin + Vjn}, 1 < i < j < ra- 1). 

Combining this with (|3.4f) . we obtain the sufficiency of verifying the equalities 
£(A« fc (1) )=£(A( 2 >,fc< 2 >) for jfeW.fcW el. 

In the presence of zero elements above the diagonal of A^ 1 ) and A^ 2 ), the propo- 
sition is proved with slight modifications. 

The finite set of matrices /C(A, k), k G I, that are unitarily similar to A is called 
the canonical family of the given matrix. 

Thus, the following algorithm is proposed for verifying unitary similarity be- 
tween nonderogatory matrices A and B with the same set of eigenvalues: 

(i) Reduce these matrices to an upper triangular form with identically ordered 
eigenvalues on the diagonal to obtain matrices A^ 1 ) and A^: 

(3.7) A (1) = UiAU*, A (1) = U2BU2 

(ii) For A^ 1 ) and A^ 1 ), construct their canonical families /C(AW, k^ 1 ') and 
£(A(V (2) ), iW.^el 

(iii) If these families intersect for some feW.JfeW G I and 

(3.8) K(A^,k (1) ) = XxA^XZ, K(A^ 2 \k {2) ) = X 2 A^X* 2 , 
then the original matrices are similar and 

(3.9) B = UAU*, U = U 2 *X 2 *X 1 U 1 . 
Otherwise, they are not similar. 

4. Numerical stability 

The approach presented above significanly differs from earlier approaches to 
the problem studied. As a rule, different approaches (e.g. OLlQ]) , based on the 
Schur upper triangular form, tried to create as many positive elements above the 
diagonal as possible. But such a property of a desired canonical form inevitably 
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leads to the form unstable with respect to errors in initial triangular form. One 
may observe the present effect on the next example: 

% % % 
2 i i 
3 e ' 
4 

where e is a complex number. If the initially "strategy" of obtaining the greatest 
possible number of positive off-diagonal elements is to start with superdiagonal 
elements, then one can chose a A(e) arbitrary close (e.g. with respect to the 
Frobenius norm) to A(0), but their canonical forms won't satisfy this property. 
The stability property seems even more significant due to the fact that usually 
an upper trianglular form of a matrix is obtained by approximate methods (e.g. 
QR algorithm). 

From the geometric point of view the constructed canonical family is the fi- 
nite set of the ruled surfaces, such that an orbit of each nonderogatory matrix 
intersects each of them in a single point. The stability of this set of intersec- 
tion points follows from the continuity of quantities (|2,21|) determined from 
system (|2.22|) . The present property is of special interest in the context of the 
result obtained in [TT]. Many ideas used by the author were taken from |12j . 
Specifically, a minimal continuous extension of a canonical Jordan form was 
constructed in |12| . Some of the results presented above are reflected in (13] , 

I am deeply grateful to Professor Kh.D. Ikramov for his interest in this work 
and helpful discussions. 



(4.1) 



A(e) 
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